Picture for Wenhu Chen

Wenhu Chen

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

Add code
May 26, 2026
Viaarxiv icon

Starve to Perceive: Taming Lazy Perception in VLMs with Constrained Visual Bandwidth

Add code
May 18, 2026
Viaarxiv icon

Bad Seeing or Bad Thinking? Rewarding Perception for Vision-Language Reasoning

Add code
May 13, 2026
Viaarxiv icon

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Add code
May 11, 2026
Viaarxiv icon

ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning

Add code
Apr 27, 2026
Viaarxiv icon

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Add code
Apr 27, 2026
Viaarxiv icon

MMEB-V3: Measuring the Performance Gaps of Omni-Modality Embedding Models

Add code
Apr 25, 2026
Viaarxiv icon

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Add code
Apr 14, 2026
Viaarxiv icon

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Add code
Apr 09, 2026
Viaarxiv icon

ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks

Add code
Mar 29, 2026
Viaarxiv icon